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Abstract 

The topic of this paper is the distributed and incremental generation of long 
executions of concurrent systems, uniformly or more generally with weights 
associated to elementary actions. 

Synchronizing sequences of letters on alphabets sharing letters are known 
to produce a trace in the concurrency theoretic sense, i.e., a labeled partially 
ordered set. We study the probabilistic aspects by considering the synchroniza¬ 
tion of Bernoulli sequences of letters, under the light of Bernoulli and uniform 
measures recently introduced for trace monoids. 

We introduce two algorithms that produce random traces, using only lo¬ 
cal random primitives. We thoroughly study some specific examples, the path 
model and the ring model, both of arbitrary size. For these models, we show 
how to generate any Bernoulli distributed random traces, which includes the 
case of uniform generation. 


1—Introduction 


The developments of concurrency theory and of model checking theory have urged 
the development of a theory of probabilistic concurrent systems. A central issue is 
the problem of the uniform generation of long executions of concurrent systems. Ex¬ 
ecutions of a concurrent system can be represented as partial orders of events. Each 
partial order has several sequentializations, the combinatorics of which is non trivial. 
Therefore, the uniform generation of executions of a concurrent system is much dif¬ 
ferent from the uniform generation of their sequentializations. The later can be done 
using uniform generation techniques for runs of transition systems, at the expense of 
an increasing amount of complexity due to the combinatorics of sequentializations 18 
[9 20 . Yet, it still misses the overall goal of uniform generation among executions of 


the system, since it focuses on their sequentializations. 

We consider the framework of trace monoids 10 , also called monoids with partial 
commutations |^. Trace monoids have been studied as basic models of concurrent 
systems since several decades 17 11 . One uses the algebraic commutation of genera¬ 


tors of the monoid to render the concurrency of elementary actions. The elements of 
the trace monoid, called traces, represent the finite executions of the concurrent sys¬ 
tem 10 . More sophisticated concurrency models build on trace monoids, for instance 


19 


executions of 1-safe Petri nets correspond to regular languages of trace monoids 
The topic of this paper is the effective uniform generation of large traces, i.e., 
large elements in a trace monoid. It has several potential applications in the model 
checking and in the simulation of concurrent systems. 
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In a recent work co-authored with J. Mairesse [^, we have shown that the no¬ 
tion of Bernoulli measure for trace monoids provides an analogous, in a framework 
with concurrency, of classical Bernoulli sequences—i.e., the mathematical model of 
memoryless coin tossing. In particular, Bernoulli measures encompass the uniform 
measure, an analogous for trace monoids of the maximal entropy measure. Therefore 
Bernoulli measures are an adequate theoretical ground to work with for the random 
generation of traces, in particular for the uniform generation. 

Bernoulli sequences are highly efficiently approximated by random generators. An 
obvious, but nevertheless crucial feature of their generation is that it is incremental. 
For the random generation of large traces, we shall also insist that the generation pro¬ 
cedure is incremental. Furthermore, another desirable feature is that it is distributed, 
in a sense that we explain now. 

We consider trace monoids attached to networks of alphabets sharing common 
letters. The synchronization of several sequences of letters on different local alphabets 
sharing common letters is known to be entirely encoded by a unique element of a 
trace monoid. If S denotes the union of all local alphabets, then the synchronization 
trace monoid is the monoid with the presentation by generators and relations M = 
{T, \ ab = ba), where (a, b) ranges over pairs of letters that do not occur in any common 
local alphabet. Hence, seeing local alphabets as “resources”, two distinct letters a and 
b commute in M if and only if they do not share any common resource—a standard 
paradigm in concurrency theory. 

In this framework, our problem rephrases as follows: given a synchronization 
trace monoid, design a probabilistic protocol to reconstruct a global random trace, 
uniformly among traces, and in a distributed and incremental way. By “distributed”, 
we mean that the random primitives should only deal with the local alphabets. The 
expression “uniformly among traces” deserves also an explanation, since traces of a 
monoid are countably many. One interpretation is to fix a size k for target traces, and 
to retrieve a trace uniformly distributed among those of size k. Another interpretation 
is to consider infinite traces, i.e., endless executions of the concurrent system. It 
amounts in an idealization of the case with large size traces. We then rely on the 
notion of uniform measure for infinite traces, which happens to have nicer properties 
than the uniform distribution on traces of fixed size. As explained above, the uniform 
measure belongs to the largest class of Bernoulli measures for trace monoids. Hence, 
a slightly more general problem is the distributed and incremental reconstruction of 
any Bernoulli measure attached to a synchronization trace monoid. 

We introduce two algorithms that partly solve this problem, the Probabilistic Syn¬ 
chronization Algorithm (PSA) and the Probabilistic Full Synchronization Algorithm 
(PFSA), the later building on the former. According to the topology of the network, 
one algorithm or the other shall be applied. We show that the problem of generating 
traces according to a Bernoulli measure is entirely solved for some specific topologies 
of the network, namely for the path topology and for the ring topology—it could 
also be applied successfully to a tree topology. It is only partly solved for a general 
topology. Yet, even in the case of a general topology, our procedure outputs large 
random traces according to a Bernoulli scheme, although it is unclear how to tune 
the probabilistic parameters in order to obtain uniformity. Furthermore, the amount 
of time needed to obtain a trace of size k is linear with k in average. 

Several works analyzing the exchange of information in concurrent systems restrict 
their studies to tree topologies, see for instance 13 , and very few has been said on 
the probabilistic aspects. In particular, designing an incremental and distributed 
procedure to uniformly randomize a system with a ring topology has been an unsolved 
problem in the literature so far. 

How does our method compare with standard generation methods based on tools 
from Analytic Combinatorics, such as Boltzmann samplers techniques? There exists 


2 



a normal form for traces, the so-called Cartier-Foata normal form, from which one 
derives a bijection between traces of a given trace monoid and words of a regular 
language. This seems to draw a direct connection with Boltzmann sampling of words 
from regular languages ( 12 p. 590 ], [^). This approach however suffers from some 


drawbacks. First, the rejection mechanism which is at the very heart of the Boltzmann 
sampling approach prevents the construction to be incremental, as we seek. This 
could be avoided by considering instead the generation of the Markov chain of the 
elements of the normal form associated to the uniform measure on infinite traces, 
as investigated partly in [^. But this leads to a second problem, namely that the 
randomness now concentrates on the set of cliques of the trace monoid, a set which 
size grows exponentially fast with the number of generators of the monoid in general. 
It henceforth misses the point of being a distributed generation. 


Outline. Section illustrates on small examples the two algorithms for generat¬ 
ing random traces from a network of alphabets, the PSA and the PFSA, to be fully 
analyzed in forthcoming sections. Sections and Section are two preliminary sec¬ 
tions, gathering material on the combinatorics of trace monoids for the first one, and 
material on Bernoulli measures for trace monoids for the second one. The non-random 
algorithms for the synchronization of sequences, possibly infinite, that we present in 
Section seem to be new, although the fundamental ideas on which they rest are not 
new. A new result on the theory of Mobius valuations is given at the end of Section]^ 

Our main contributions are organized in Sections and Section is devoted 
to the Probabilistic Synchronization Algorithm (PSA). The analysis of the algorithm 
itself is quite simple and short. The most striking contributions are in the examples. 
In particular, for the surprising case of the path model, the PSA is shown to work the 
best one could expect. 

Section is devoted to the description and to the analysis of the Probabilistic 
Full Synchronization Algorithm (PFSA), both from the probabilistic point of view 
and from the complexity point of view. Examples are examined; the ring model of 
arbitrary size is precisely studied, and we obtain the satisfying result of simulating 
any Bernoulli scheme on infinite traces by means of the PFSA. 

Finally, Section discusses some additional complexity issues and suggests per¬ 
spectives. 


2—Illustrating the PSA and PFSA algorithms 

In this section, we illustrate our two generation algorithms on small examples, as well 
as the tuning of their probabilistic parameters, at an informal and descriptive level. 

2.1 — Illustrating the Probabilistic Synchronization Algorithm (1) 

Let 00,01,02,03,04 be five distinct symbols, and consider the four alphabets: 

Si = {00,01}, ^2 = {01,02}, £3 = {02,03}, E4 = {03,04}. 

To each alphabet is attached a device able to produce a random sequence 
Yi = {Xi,i:Yi,2, ■ ■ ■) of letters Fij € E^. The random letters are independent and 
identically distributed, according to a distribution pi = (pi(oi_i) Pi{ai)) to be 
determined later, but with positive coefficients. Furthermore, we assume that the 
devices themselves are probabilistically independent with respect to each other. For 
instance, the beginnings of the four sequences might be: 

Yi = (oioooooioo...) Y2 = (01O2O2O2O2...) 

I3 = (02O2O3O3O2 ...) I4 = (0403030304 ...) 
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Figure 1 : Heap of pieces corresponding to the vector V 


do fll 02 O3 04 

• - • - • - • - • 


Figure 2 : Synchronization graph of the path model with five generators 


Then we stack these four sequences in a unique vector Y with four coordinates, 
and we join the matching letters from one coordinate to its neighbor coordinates, in 
their order of appearance. The border elements oq and 04 do not match with any 
other element; some elements are not yet matched (for example, the second occurrence 
of Oi in Fi), and we do not picture them. This yields: 


Oi 

1 

Oo 

Oo 

Oi 

02 

1 

02 
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02 

02 


V 04 


...\ 
02 ... 

03 03 02 

I I 

O3 03 . . . / 


Finally, we identify each matching pair with a single piece, labeled with the match¬ 
ing letter and we impose a rotation of the whole picture by a quarter turn counter¬ 
clockwise. We obtain the heap of pieces depicted in Figure It is apparent on this 
picture that pieces Oi and Oi+i share a sort of common resource. Therefore we depict 
the topology of the system resulting from the synchronization of the network of al¬ 
phabets (El, E2, II3, S4) in Figure]^ by drawing an undirected graph with pieces as 
vertices, and an edge between two pieces whenever they share a common resource, or 
equivalently, whenever they both appear in the same alphabet. 

The PSA algorithm consists, for each device attached to each alphabet E^, to do 
the following: 1 ) Generate its own local random sequence; 2 ) Communicate with the 
neighbors in order to tag the matching of each of the occurring letters. 

Since the coefficients pi{ai-i) and Pi{ai) are positive, each coordinate Yi has in¬ 
finitely many occurrences of both letters oi-i and a^. Furthermore, each occurrence 
of a letter Oj with j 7^ 0 ,4 will eventually match a corresponding occurrence in a 
neighbor coordinate. Therefore the random heap that we obtain, say is infinite 
if the algorithm keeps running forever. The PSA produces incremental hnite ap¬ 
proximations of f. The probabilistic analysis of the PSA consists in determining the 
probability distribution of the theoretical inhnite random heap f that the PSA would 
produce if it was to run indefinitely. 

In order to characterize the probability distribution of we shall denote by f 
for each hnite possible heap x, the probabilistic event that ^ starts with x. This means 
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that the hnite heap x can be seen at the bottom of Equivalently: ^ € t ifj after 
waiting long enough, the finite heap x can be seen at the bottom of the current heap 
produced by the PSA. 

We aim at evaluating the probability P(t x) for every finite heap x. For this, 
consider an arbitrary possible sequentialization of x, seen as a successive piling of 
different occurrences of the elementary pieces, which we write symbolically as x = 
Xi ■ ... ■ Xk with Xj G {og, Oi, 02, 03, 04}. Then P( f x) is obtained as the following 
product: 

P(t x) = 4 , • .• 4 ,, 

where the coefficient tx^ corresponds to the probability of having the letter xj ap¬ 
pearing either on its single coordinate if Xj = og or xj =04, or on both coordinates 
to which it belongs otherwise. Hence: 

tao=Piiao) toi =Pi(ai)-^2(01) ia2 =^2(02)-^3(02) 

taa = Psiaa) ■ Piias) ta^ = P4(a4)- 

For instance, if the local probabilities pi were uniform, pi = (l /2 1 / 2 ), we would 

have: 

_ 1 _ 1 _ 1 _ 1 _ 1 

“ 2 ^“1 ~ 4 ^“2 “ ^ ^0.3 — ^ la 4 — 2' 

This choice, which may seem natural at first, would actually produce a random heap 
with a bias, namely it would favor the appearance of pieces og and 04. Since ta„ = 1/2 
and tai = 1/4 in this case, it would produce on average twice more occurrences of og 
than occurrences of oi. 

However, for uniform generation purposes, it is desirable to obtain all coefficients 
tai equal. Can we tune the initial probability distributions Pi in order to achieve this 
result? Introduce the Mobius polynomial p,{z) = 1 — 5 z -I- , and consider its 

root of smallest modulus, namely gg « 0 . 308 . It turns out that the only way to have 
all coefficients ta^ equal is to make them precisely equal to gg. In turn, this imposes 
conditions on the local probability distributions pi,p2,P3jP4 with only one solution, 
yielding for this example: 

Pi{ao) = qo~ 0.308 - 
P2iai) = « 0.445 

P 3 (a 2 ) = ’Sl/ « 0.555 

^4(03) = 1 - go « 0.692 

This array of positive numbers has the sought property that the product of any two 
numbers along the depicted diagonals equals gg, and the sum of every line equals 1; 
and go is the only real allowing this property for an array of this size. Running 
the PSA with these values for the local probability distributions produces a growing 
random heap which is uniform, in a precise meaning that will be formalized later in 
the paper. 

2.2 — Illustrating the Probabilistic Synchronization Algorithm (2) 

Consider the following network of alphabets with the four letters Oo, Oi, 02, 03: 

Si = {ao,ai}, ^2 = {01,02}, £3 = 102,03}, £4 = {03,00}. 


•+•=1 


;Pi(oi) = 1 - go « 0.692 


•X'= 9 o, 


■ + ■=1 


;P2(o2) = -jd 


_ 1 —2go 


Qo 


0.555 


■X-=qo 


• + •=1 


•x-=qo_ 


•+•=1 


;P3(a3) = T^« 0.445 


- 4 ) 4 ( 04 ) = go « 0.308 
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Figure 3 : Synchronization graph of the ring model with four generators 
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Figure 4 : Output heap resulting from an instance of the PSA for the ring model with 
four generators 


Analogously to the previous example, the synchronization graph for this network 
is depicted in Figure Let us try to apply the same generation technique as in 
the previous example. For each alphabet we consider a random sequence Yi of 
letters of this alphabet. Given the symmetry of the network of alphabets, in order to 
obtain a uniform distribution we need to consider this time uniform local distributions 
Pi = (l /2 1 / 2 ) for * = 1 , 2 , 3 , 4 . This yields for instance: 

Yi = oouiaioo ■ ■ • ^2 = 01020201 ■ • ■ F3 = 03020302 ... Y4 = 00030903 ... 


The construction of the stacking vector Y with the matching of letters yields: 



Y = 


Ol 

Ol 

03 


02 

I 

02 







the two occurrences of oq being connected. 


The corresponding heap is depicted on Figure It is then impossible to extend 
this heap while still respecting the sequences Yi, Y2, F3,14; not because some letter is 
waiting for its matching, but because the different letters already present are blocking 
each other, creating a cycle that prevents to interpret the rest of the vector as an 
extension of the heap of Figurej^ Such a cycle will appear with probability 1 at some 
point, whatever the sequences Yi, Y2,Y3,Y4. Henceforth the PSA only outputs finite 
heaps for this model. 


2.3 — Illustrating the Probabilistic Full Synchronization Algorithm 

In the previous example, we have seen that the PSA outputs a finite heap with 
probability I, whereas we would like to obtain heaps arbitrary large. The Probabilistic 
Full Synchronization Algorithm (PFSA) is designed to solve this issue. We will now 
describe how to tune it, for the ring model with four generators introduced above and 
illustrated in Figure in order to produce infinite heaps uniformly distributed. 
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(i) 


(ii) reject 


(iii) 


(iv) accept 


Figure 5 : First iteration of the PFSA. (i) The finite output of the PSA on three 
generators Uq, 01,02- (ii) Adding the piece 03 on top of the obtained heap is not 
pyramidal since occurrences of Oi can be removed without moving 03. (iii) Another 
instance of the heap on three generators 00,01,02- (iv) This time, the new heap 
^1 obtained by adding 03 on top of is pyramidaL 


First, the theory of Bernoulli measures for trace monoids (see below in Section]^ 
tells us that each piece of the monoid must be given the probabilistic weight qi , root of 
smallest modulus of the Mobius polynomial fi{z) = l — 4 z + 2 z‘^, hence qi = 1 —•\/ 2 / 2 - 
Second, we arbitrarily choose a piece, say 03, to be removed- We are left with the 
following network of alphabets: S'l = {ao,ai}, S2 = {01,02}, S3 = {02}, S4 = (aoj- 
From the heaps point of view, this is equivalent to the network with only two alphabets 
(S'i,S2), and sharing thus the only letter oi- We will now design a variant of the 
PSA algorithm, to be applied to the network (E},S2), outputting finite heaps with 
probability 1, and attributing the probabilistic parameter qi to each of the three 
pieces 00,01,02- This is possible by considering sub-probability distributions p},P2 
associated to S(,E2 instead of probability distributions for the local generations- A 
possible choice, although not the unique one, is the following: 

p'i(ao) = gi = 1 - V2/2 -p'i(oi) = 1 - gi = V2/2 

P2(ai) = 1^37 = - 1 —-^^^^ 2 ( 02 ) = <71 = 1 - 72/2 

Note that the first line sums up to 1 whereas the second line sums up to less than 1, 
and this guaranties that the PSA executed with these values will output a finite heap 
and stop with probability 1- 

Let be the output of the PSA on (E'ljE^) with the above parameters- It is a 
finite heap built with occurrences of Og, Oi and only- Then, we add the piece 03 on 
top of and we check whether the obtained heap is pyramidal, which means that 
no piece can be removed from it without moving the last piece 03 (see Figure]^- If it 
is not pyramidal, we reject it, and re-run the PSA, producing another instance of 
until ■ 03 is pyramidal- At the end of this process, we obtain a pyramidal heap 
Cl = 7 ■ built with the four generators ag, ui, 02, 03, and a unique occurrence of 03- 
The process of computing this heap Ci is the first loop of the PFSA- We initialize 
what will be the final output of the algorithm by setting yi = Ci - We then reproduce 
the above random procedure, yielding a pyramidal heap C2 and then we form the heap 
y2 = yi ■ f,2, obtained by simply piling up C2 on top of yi. We iterate this procedure: 
outputting Ca pyramidal, we put 1/3 = 1/2 • Csj and so on- Then the increasing heaps 
?/i, ?/2, • - - approximate an infinite heap on the four generators Og, Oi, 02,03 which we 
claim to be uniformly distributed- 
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3—Preliminaries on trace monoids 

An alphabet is a finite set, usually denoted by E, the elements of which are called 
letters. The free monoid generated by E is denoted by E* . 


3.1 — Combinatorics of trace monoids 


10 . The unit 


3 . 1.1 Definitions — An independence pair on the alphabet E is a binary, irreflexive 
and symmetric relation on E, denoted I. The trace monoid M. = Ad(E,/) is the 
presented monoid A 4 = {T, \ ab = ba for all (a, 6) € I). Hence, if TZ is the smallest 
congruence on E* that contains all pairs {ab, ba) for (o, b) ranging over /, then Ai 
is the quotient monoid Ai = YA/TZ. Elements of Ai are called traces 
element is denoted by e, and the concatenation in Ai is denoted by 

The length |a::| of a trace x £ Ai is the length of any word in the equivalence 
class X. The left divisibility relation on Ai is denoted by <, it is defined by x < 
y 3 z & Ai y = X ■ z. 

3 . 1.2 Cliques — A clique of Ai is any element x G Ai of the form x = ai ■ ... • On 

with Oi G Y and such that G I for all i ^ j. Cliques thus defined are in 

bijection with the cliques, in the graph-theoretic sense, of the pair (E, I) seen as an 
undirected graph, that is to say, with the set of complete sub-graphs of (E,/). 

The set of cliques is denoted by The unit element is a clique, called the empty 
clique. The set of non empty cliques is denoted by £. 

For instance, for Ai = (09,01,02,03,04 | OiOj = ajOi for \i — j\ > 1 ), we have 
^ = {e, oo, oi, 02, 03, 04, 0902, 09O3, 0904, 01O3, 01O4, 0204, 09O2O4}. We call 
this trace monoid the path model with five generators, it corresponds to the example 
introduced in Section [O] Note that the synchronization graph depicted in Figure 
is not (E, I) but its complementary. 

3 . 1.3 Growth series and Mobius polynomial — The growth series of Ai is the formal 
series 


ZM(t)= ^ 
xeM 

The Mobius polynomial is the polynomial fiM (t) defined by 

cG'^ 

For the path model with five generators introduced above, we have = 

l-bt + et"^ -C. 

The Mobius polynomial is the formal inverse of the growth series [6 

Zm (t) = i/y-M (t) ■ 

The Mobius polynomial has a unique root of smallest modulus, say po . This root 
is real and lies in (0,1], and coincides with the radius of convergence of the power 
series Z^it) [l^fTd] . 

3 . 1.4 Multivariate Mobius polynomial — The Mobius polynomial has a multivari¬ 
ate version, yj^{ti,.. ., 1 ^) where ti,... are formal variables associated with the 
generators Oi,..., a^r of Af. It is defined by: 

yAi (^ 1 ; • ■ • ; I'N ) ^ ^ ( 1 )^ ‘ ' i'jc 7 
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where the variables tj^,..., tj^ correspond to the letters such that c = . 






acbdc 


cadbc 


a-c-b-d-c = c- a- d- b- c 



c 




d 

b 



c 


a 



(i) 



c 


b 





d 

a 





c 


(ii) 



Figure 6: (i)-(ii): representation of the two congruent words acbdc and cadbc in 
{a,b,c,d I ac = ca, ad = da, bd = db). (iii): the resulting trace a ■ c ■ b • d ■ c, 
represented as a heap of pieces, (iv): the associated labeled ordered poset. For 
instance a < d does not hold since acbdc —)■ cabdc —> cadbc —>■ cdabc. 


3 . 1.5 Irreducibility — The dependence relation associated with M = is 

the binary, symmetric and reflexive relation I? = (S x E) \ /. The trace monoid Ai 
is irreducible if the pair (E, D) is connected as a non oriented graph. 

If M is irreducible, then the root po of M is simple . 


3.2 — The heap of pieces interpretation of traces 


3 . 2.1 The picture — Viennot’s theory provides a visualization of traces as heaps 
of pieces 22 . Picture each trace as the piling of dominoes labeled by the letters of 
the alphabet, and such that dominoes associated with two letters a and b fall to 
the ground according to parallel lanes, and disjoint if and only if (a, b) € I. See an 
illustration on Figure]^ (i)-(iii), for A 4 = {a, b,c,d \ ac = ca, ad = da, bd = db). 

3 . 2.2 Traces as labeled ordered sets — The formalization of this picture is done by 
interpreting each trace a; of a trace monoid as an equivalence class, up to isomorphism, 
of a E-labeled partial order, which we describe now. 

Let a; be a trace, written as a product of letters x = oi •... • a„ . Let x = { 1 ,..., n}, 
and define a labeling ^ : x —>■ E by 4 >{i) = ai. Equip x with the natural ordering 
on integers <. Then remove the pair {i,j) from the strict ordering relation < on 
X whenever, by a finite number of adjacent commutations of distinct letters, the 
sequence (oi,..., a„) can be transformed into a sequence (acr(i),. ■., ao'(n)) such that 

o’W > ^{j)- 

The set of remaining ordered pairs (i,j) is a partial ordering on x, which only 
depends on x, and which defines the heap associated with x. See Figure]^ (iv). 


3.3 — Completing trace monoids with infinite traces 

3 . 3.1 Infinite traces — Let (x„)„>i and (j/n)n>i be two nondecreasing sequences 
of traces: x„ < x„+i and yn < yn+i for all integers n > 1 . We identify (x„)„>i and 
{yn)n>i whenever they satisfy: 

(Vn > 1 3 m >1 x„ < ym) A (Vn > 1 3 m >1 ?/n < Xm) ■ 

This identification is an equivalence relation between nondecreasing sequences. 
The quotient set is denoted by AI. It is naturally equipped with a partial ordering, 
such that the mapping AI —>■ AI associating the (equivalence class of the) constant 
sequence x„ = x to any trace x G AI, is an embedding of partial orders. 
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We identify thus A4 with its image in M through the embedding M. ^ M.. The 
set dM. of infinite traces is defined by: 

dM=M\M. 

The set dM is called the boundary of [^. Visually, elements of dM corre¬ 
spond to infinite countable heaps, that is to say, limiting heaps obtained by piling up 
countable many pieces. 

3 . 3.2 Properties of {M,<) — The partially ordered set {M,<) is complete with 
respect to least upper bound of nondecreasing sequences. And any element ^ € A 1 is 
the least upper bound of a nondecreasing sequence of elements of Af [^ . 

3.4 — Synchronization of sequences 

In this section we introduce an alternative way of looking at trace monoids, by means 
of vectors of words. This emphasizes the distributed point of view on trace monoids. 

3 . 4.1 Network of alphabets and synchronizing trace monoid. — A family (Si,..., Sjv) 
of N alphabets, not necessarily disjoint, is called a network of alphabets. Let S = 
El U... U Ejv . Define i? = {l,...,iV}as the set of resources, and consider the product 
monoid 


H = (El)* X ... X (Eat)*. 


( 1 ) 


Identify each a G E with the element H{a) = [ai)i^R of PL defined by: 


Vi G i? 


e (the empty word), 
a. 


if a ^ Ei, 
if a G Ei. 


For each letter a G E, the set of resources associated with a is the subset R{a) 
defined by 


R{a) = {i £ R : a G E^} . 

Let be the sub-monoid of R generated by the collection {H{a) : a G E}. 

Then is isomorphic with the trace monoid M = A 1 (E, I), where the independence 
relation / on E is defined by 

(a, b)£l ^ R{a) n R{b) = 0 . ( 2 ) 

The monoid M is called the synchronization trace monoid, or simply the synchro¬ 
nization monoid, of the network (Ei ,..., Eat). Examples have been given in Section]^ 
It can be proved that every trace monoid is isomorphic to a synchronization trace 
monoid [^. 

3 . 4.2 Synchronization of sequences. — Let (Ei,..., E^v) be a network of alphabets. 
Each monoid (E^)* being equipped with the prefix ordering, we equip the product 
monoid R defined in Q with the product order, and the sub-monoid ^ with the 
induced order. 

Assume given a vector of sequences Y = (Yi,..., Fat) G R. The synchronization 
trace of Y is defined as the following least upper bound in : 

X = \J{Z£L^ : Z<Y}. ( 3 ) 

This least upper bound does indeed exist in it is obtained by taking least upper 
bounds component-wise, which are all well defined. 
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Identifying ^ with the trace monoid AI defined in (H and above, the element X 
is thus the largest trace among those tuples below Y. 

For example, consider the ring model with four generators introduced in Sec¬ 
tion 2.2 with Si = {ao,ai}, S2 = {01,02}, S3 = {02,03} and S4 = {03,00}, and 


the vector: 


Y = 


/OoOiOiOoX 
01020201 

\00030003y 


One easily convinces oneself that the synchronization of Y is the following: 

( OqOi^ 

corresponding to oq • oi • 03 • 02 in the trace monoid. 


X = 


O1O2 

03O2 

\a0a3J 


We shall see below an algorithmic way to determine the synchronization of any given 
vector Y gH. 

3 . 4.3 On-line computation features. — The algorithmic computation of the syn¬ 
chronization trace of a given vector of sequences might belong to the folklore of 
concurrency theory, see for instance |^. In what follows, an special emphasis is given 
to the distributed and on-line nature of this computation. 

The remaining of this section is devoted to provide an algorithm taking a vector 
of sequences Y G % ss input, and outputting the synchronization trace of Y. Fur¬ 
thermore, the input trace Y might not be entirely known at the time of computation. 
Instead, we assume that only a sub-trace Y' <Y feeds the algorithm, together with 
the knowledge whether some further input is about to come or not. Therefore, we 
need our algorithm to produce the two following outputs: 


1 . The best approximation X' of the synchronization trace X, given only the 
input F', which is merely the synchronization trace of Y'. 


2. And one of the following tags: 


DL for “Deadlock” if some more input is about to come, and yet the algo¬ 
rithm has already reached the synchronization trace of Y, i.e., \i X' = X 
whatever the continuation of Y'. 

WFI for “Waiting for Input” if some more input is about to come, and the trace 
X' might be extended. 

EOF for “End of file” if there is no more input to come. 


3 . 4.4 Detecting deadlocks — Consider, as in Sections 3 . 4.1 — 3 . 4.2 a network (Si,..., 
of alphabets. Given a finite word Yi G (S^)*, we denote by Yi the word Yi =Yi ■ fj, 
where fj is an additional symbol which can be either EOF of WFI, to be interpreted 
as “no more input will ever come” if f^ = EOF, or as “some more input might arrive” 
if fj = WFI. Finally, let F = {Yi,... ,Y 

We first introduce a basic routine described in pseudo-code below (Algorithm [^ . 
The input of the routine is the vector F. Its output is as follows: 


1 . If the synchronization trace X of F is non empty, then the routine outputs a 
minimal piece of X. 

2 . If the synchronization trace A of F is empty, then the routine outputs a flag | 
explaining why X is empty: 


Sat) 
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a) If F = (eof, ... ,eof), then | = EOF. Note that this necessarily entails 
that V is empty. 

b) In case at least one input component of V does not carry the symbol 
tj = EOF, then: 

i. If adding some letters to some of the components of Y carrying f^ = 
WFI could yield a non empty synchronization trace, then | = WFI. 

ii. If no letter can be added to the components of Y carrying f^ = WFI 
to yield a non empty synchronization trace, then | = dl. 


To detect if a piece u is minimal in a heap, when the heap is given by its rep¬ 
resentation in it is necessary and sufficient to check that it is minimal in all the 
coordinates where it occurs, that is to say, in all the components belonging to the 
set of resources TZ(u). This justifies that Algorithm is sound. Observe that the 
algorithm is non deterministic, as there may be several minimal pieces. 

Algorithm executes in constant time, the constant growing linearly with the 
number of edges in the dependence graph {T,,D) of the associated trace monoid. If 
communicating processes are devised, one per each alphabet, and able to write to a 
common register, they will be able to perform Algorithm in a distributed way. We 
shall not work out the details of the distributed implementation, since it would be 
both outside the scope of the paper and out of the range of expertise of the author. 
People from the distributed algorithms community will probably find it routine. 

3.4.5 Computation of the synchronization trace. — With Algorithm at hand, we 
can now give an algorithm to compute the synchronization trace of a given vector of 
sequences Y This is the topic of the Synchronization Algorithm (Algorith m , 
of which we give the pseudo-code below, and which takes as input a vector Y = 
{Yi,... ,Y n) of the same kind as Algorithm 

The Synchronization Algorithm iteratively executes Algorithm and collects the 
minimal pieces thus obtained to form it own output. If T = (Yi,..., Yat) € % and if 
M <Y, then we denote hy Z = M\Y the unique vector Z such that M ■ Z = Y. 

As in the requirements stated above in Section 3 . 4.3 the Synchronization Algo¬ 
rithm outputs both the synchronization trace of its input and a tag advertising if 
the computation is over, either because a deadlock has been reached or because the 
input feed is over, or if it is still waiting for some input that might cause the syn¬ 
chronization trace computed so far to be extended. This feature will be crucial when 
considering its execution on sequences of letters which are possibly infinite, see below 
in Section 13 . 4.61 

The Synchronization Algorithm can be executed in a distributed way by N com¬ 
municating processes, one for each coordinate, and it runs in time linear with the size 
of the synchronization trace X. 

3.4.6 Feeding the Synchronization Algorithm with a possibly infinite input — We 
have defined the synchronization trace of a vector Y G H in Section | 3 . 4.2 The same 
definition applies if one or several components of Y are infinite, by putting: 


X = \J{Zg^ : Z<Y}, 


( 4 ) 


and this least upper bound is always well defined in Ai since least u pper b ounds of 


nondecreasing sequences always exist in (AI,<) as recalled in Section 


3 . 3.2 


We call 


the element A of A 4 thus defined the generalized synchronization trace of Y. 

Adapting the algorithmic point of view to an infinite input introduces obviously 
some issues, which can be addressed by observing that the output of the Synchroniza¬ 
tion Algorithm is nondecreasing with its input. We will idealize the situation where 
the Synchronization Algorithm is repeatedly fed with a nondecreasing input by saying 
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Algorithm 1 Determines a minimal piece of the synchronizing trace X oi Y = 


Require: Yi, ... ,Yjs[ 

1 ; for alH € {1,. .., N} do 
Ui ^ first letter of Yi 

3: end for 

4: ff ^ {i : Ui ^ EOF} 

5: H ^ {i : Ui = EOF} 

6: if R = 0 then 
return EOF 
end if 

9: A ^ e iJ : Ui ^ WFl} 

if A = 0 then 
return WFI 
end if 

for alH G A do 

if TZ{ui) n H then 


10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 

26 

27 

28 

29 

30 

31 

32 

33 

34 

35 

36 

37 

38 

39 

40 

41 


t> Recall that Yi = Yi ■ 

> Ui is either a real letter or f ^ 

> The set of indices of interest 

> The complementary set of H 
t> Case | 2 ^ of the above discussion 

> The set of indices with real letters 


> One instance of Case 2 (b)i 


Mi ^ DL 


else 


if Uj = Ui for all j G TZ{ui) then 


> No chance to obtain later the 
expected synchronization for Ui 

> Case where Ui is minimal 
in all the expected components of Y 


> The piece Ui is not minimal in X 


Mi ^ Ui 

else 

if Uj G {uijWFi} for all j G TZ{ui) then t> Synchronization 

for Ui is possible in the future 

Ml •(— WFI 

else 

Ml ^ DL 

end if 
end if 
end if 
end for 

if Mi = DL for all i G A then > Case 2 (b)ii the synchronization 

trace X is empty since it has no minimal piece 

return DL 
else 

if Mi = WFI for alH G A then > Case 2 (b)i (again) 

return WFI 

else > Case[T] 

return one Mi with Mi ^ {dl,wfi} > Any Mi ^ {dl,wfi} is minimal 

in the synchronization trace X 

end if 
end if 
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Algorithm 2 Synchronization Algorithm: computes the synchronization trace of Y 

Require: r = (Fi,..., Fat) [>Fi = F,-f, 

1: X <— e > Initialize the variable X with the empty heap 

2: call Algorithm with input F 
3: M ^ output of Algorithm 
4; while M ^ {dl,wfi,eof} do 
5: X ^ X -M 

6 : F ^ M\Y 

7; call Algorithm with input F 

8 : M -(r- output of Algorithmic 

9: end while 
10; return (A, M) 


that it is fed with a vector with possibly infinite components. The output, possibly 
infinite, is defined as the least upper bound in M. of the nondecreasing sequence of 
finite output heaps. It coincides of course with the generalized synchronization trace 
X defined in Q. 

To describe more precisely this algorithmic procedure, consider a vector F of 
sequences, some of which may be infinite. We assume that F is effectively given 
through some sampling, i.e., as an infinite concatenation of finite vectors Zk S 'H: 

Y = Zi-Z2-Zs--- 

If a component of F is finite, the corresponding component of the vector Zk will be 
the empty word for k large enough. 

Furthermore, we assume that a primitive is able to produce, for each integer fc > 1 , 
a vector Zk = Zk ■ ffe, where ]k is itself a vector = (f^, with ]k,i ^ {eof,wfi}, 
in such a way that an occurrence of EOF marks the finiteness of the corresponding 
component of F. Formally, we assume that the following two properties hold, for all 
components: 


{Yi is a finite sequence) {3k > 1 f ^ = eof) 

and Vfc > 1 tfc j = EOF (Vfc' > k ffe'.i = eof) 

The Generalized ^nchronization Algorithm, the pseudo-code of which is given 
below in Algorithm^ is then recursively fed with Zi, Z2, ■ ■ ■, writing out to its 
output register X. The output of Algorithm]^ is the least upper bound, in At, of the 
sequence of heaps that recursively appear in the register X. 

The Generalized Synchronization Algorithm exits its while loop if and only if the 
generalized synchronization trace is finite. In all cases, its output (as defined above) is 
the generalized synchronization trace of the vector F, regardless of the decomposition 
{Zk)k>i that feeds its input. 

4—Preliminaries on probabilistic trace monoids 

We denote by the set of positive reals. 

4.1 — Bernoulli and finite Bernoulli sequences 

In this section we collect classical material found in many textbooks [^. We pay a 
special attention to presenting this material so as to prepare for its generalization to 
trace monoids. 
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Algorithm 3 Generalized Synchronization Algorithm 

Require: (Zfe)/j>i > The recursively feed the input 

1 ; A •<— e [> Initialize the variable X with the empty heap 

2: k ^ 1 

3 ; call Algorithm with input Zi 

4: ([/, t) •«— output of Algorithm]^ i> t G {eof, dl,WFi} 

5: while t = WFI do 

6 : X^X-U 

7 ; /c ^ fc + 1 

8 : call Algorithm with input Zk 

9: (G, t) ^ output of Algorithmic 

10; end while 


4.1.1 Bernoulli sequences — Classically, a Bernoulli sequence on an alphabet S is an 
infinite sequence (A„)„>i of independent and identically distributed (i.i.d.) random 
variables, where each Xi takes its values in S. In order to eliminate degenerated 
cases, we assume that the common probability distribution, say p, over S of all Xi is 
positive on S ; hence p is bound to satisfy: 

Va e E Pa > 0 , Pa = 1 ■ 

oGE 

4.1.2 Bernoulli measures — The canonical probability space associated with the 
Bernoulli sequence {Xn)n>i is the triple (dS*,5',P) defined as follows: the set dZ,* 
is the set of infinite sequences with values in E. The a-algebra ^ is the tr-algebra 
generated by the countable collection of elementary cylinders f x, for x ranging over 
the free monoid E*, and defined by 

t X = {w S 5E* : X < uj} , 

where x < to means that the infinite sequence w starts with the finite word x. Finally, 
P is the unique probability measure on {dZ*,^) which takes the following values on 
elementary cylinders: 


VxSE* P(tx)=/(x); 

here / : E* —)■ is the unique positive function satisfying: 

Va e E /(a) = Pa, yx,y G Z* f{xy) = f{x)f{y ), (5) 

where xy denotes the concatenation of words x and y. Bernoulli sequences correspond 
exactly to probability measures P on {dZ* with the following property: 

VxSE* P(tx)>0, 

Vx,yeE* P(t(xz/)) =P(tx)P(ty). 

Such probability measures on (dE*,^^) are called Bernoulli measures. 

4.1.3 Uniform Bernoulli measure — Among Bernoulli measures associated with the 
alphabet E, one and only one is uniform, in the following sense: 

Vx.yGE* |a:| = I 2 /I ^ P(tx) =P(tj/), 

where |x| denotes the length of the word x. It is characterized by P(t x) = , 

where po = 1 / 1 SI ■ 
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4.1.4 Finite Bernoulli sequences — Let p = {pa)a£Y. be a sub-probability distribu¬ 
tion over E, hence bound to satisfy: 

Va e E Pa > 0 , Pa < 1 ■ 

oGE 

To each sub-probability distribution p we associate the function / : E* —>■ K 
defined as in ([^, and also the following quantities: 

e = l- ^Pa, = 

aGS aiGS’' 

Finally we define the sub-Bernoulli measure Vp as the probability distribution over 
the countable set E*, equipped with the discrete cr-algebra, and defined by: 

Vx G E* = ^fix). 

A finite Bernoulli sequence is the random sequence of letters that compose a word 
a; G E*, drawn at random according to the probability measure Vp on E* . 

An effective way to produce a finite Bernoulli sequence according to a sub-probability 
distribution p = {pi)i is the following. Consider a stopping symbol EOF, and extend p 
to a probability distribution by setting p(eof) = 1 — pi . Then output a Bernoulli 

sequence according to p, until the symbol EOF first occurs. The letters before the 
first occurrence of EOF form the sought finite sequence. 

4.1.5 Full elementary cylinders — It is convenient to consider the following com¬ 
pletion of E*: 

E* = E* U aE* . 

Hence both Bernoulli measures and sub-Bernoulli measures are now defined on 
the same space E*. For each word a: G E*, we define the full elementary cylinder "fl" x 
as follows: 


tl- a; = {^ G E* : a: < ^} . 

Here, x < ^ has the same meaning as above if ^ is an infinite sequence; and it means 
that a: is a prefix of ^ if ^ is a finite word. We gather the description of both Bernoulli 
and sub-Bernoulli measures in the following result. 

• Theorem 4.1— Let ¥ be a probability measure on E* = E* U 9E*. Assume that the 
function / : E* —>■ K defined by f{x) = P( 'ff x) is positive multiplicative, that is to 
say, satisfies: 

Vx G E* /(x) > 0 
Vx, 2 /GE* f{xy) = f{x)f{y). 

Define p = (pa)aGS and s by: 

Va G E pa = f{a) , £ = 1 - Pa ■ 

aGS 

Then one and only one of the two following possibilities occurs: 

1. £ = 0. In this case, P is concentrated on i9E*, and characterized by: 

VxGE* P(tx)=P(iyx) = /(x). 

The series X]a;GE* /(^) divergent, and V is a Bernoulli measure. 
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2. £ > 0. In this case, P is concentrated on E*. The series Z = /(^) 

convergent, and satisfies: 

e = ^ , Va; e E* P({a;}) = e/(x). 

The measure P is a sub-Bernoulli measure. 


4.2 — Bernoulli and sub-Bernoulli measures on trace monoids 

The notions of Bernoulli measure and of sub-Bernoulli measure extend to trace 
monoids the notions of Bernoulli sequences and of finite Bernoulli sequences. They 
provide a theoretical ground for concurrency probabilistic models, in the framework 
of trace monoids. 

4.2.1 Valuations — Let be equipped with the monoid structure (K+, x, 1). A 
valuation on a trace monoid M = Af(E,/) is a morphism of monoids / : Af —> KlJ.. 
It is thus a function f : M ^ satisfying: 

/(e) = 1, Wx,yGM f{x-y) = f{x)f{y). 

4.2.2 Mobius transform; Mobius and sub-Mobius valuations — Let / : —>■ M be 

a valuation. The Mobius transform of / is the function h : ^ M defined by: 

h{c) = ^ only defined on cliques. ( 6 ) 

c'G*” : c'>c 

An alternative expression for the Mobius transform is the following. For each 
clique c let M. be the sub-trace monoid generated by those letters a S E such 
that a II c. Here, a || c reads as “a parallel to c”, and means that (a, &) G I for all 
letters b that occur in the clique c. Then: 


He) = f{c) •/i;K(c)(tl,...,tAr), 


with ti = f{ai), 


( 7 ) 


where y-MG) denotes the multivariate Mobius polynomial (see Section 3.1.4) of the 
trace monoid . By convention, the expression /r^(c) (ti,..., tj^j) actually involves 
only the variables U associated with those generators belonging to 

In particular, h{e) = ... ,tAr)—that was already observable on @. 

A valuation / is said to be [^: 


1. A Mobius valuation if: 


h{e) = 0 , Vc G £ h(c) > 0 . (8) 

2. A sub-Mobius valuation if: 

h(e)>0, Vc G £ h(c) > 0 . (9) 

Equivalently, as seen from Q, if E = {oi,..., on} and if we put ti = f{ai) for 
z G {1,..., N}, then / is: 

1. A Mobius valuation if: 


yMih, • ■ •, tN) = 0 , 

Vc G £ {ti, ■ 

■ • , tN) > 0 . 

( 10 ) 

2. A sub-Mobius valuation if: 

Mat (f^ij • ■ ■ j In) > 0 , 

Vc G £ /z^(c) (ti,. 

• • , tN) > 0 . 

( 11 ) 
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4.2.3 Cylinders and a- alqeh ras on JH and on dM. — Recall that we have introduced 
infinite traces in Section 3.3 yielding the completion A4 = A4 U 9A4. 

To each trace x G Ai, we associate the elementary cylinder ^ x C dM. and the 
full elementary cylinder 1*1" cc C A4, defined as follows: 




f X = {f G dM : X <^} = ^1 xCdM . 


The set M is equipped with the cr-algebra ^ generated by the collection of full 
elementary cylinders, and the set dM is equipped with the cr-algebra 5" induced by ^ 
on dM. Both tr-algebras are Borel cr-algebras for compact and metrisable topologies. 
The restriction of to A4 is the discrete cr-algebra. 

4.2.4 Multiplicative probability measures — In order to state an analogous result 
to Theorem |4.1| for trace monoids, we introduce the following definition, borrowed 
from [^. 

• Definition 4.2— A probability measure P on (AI,5') is said to be multiplicative when¬ 
ever it satisfies the following property: 

Vx G M P(t|- x) > 0 , 

Vx, y G M P( 'll (x • ?/)) = P( 'll x) • P( 2 /). 

We define the valuation / : AI —>■ K associated with P and the number e by: 

\/x G M /(x) = P( it x), £ = h{e) , 

where /i : ^ K is the Mobius transform of f. 

The relationship between multiplicative measures and Mobius and sub-Mobius 
valuations is as follows . 

• Theorem 4.3— There is a bijective correspondence between multiplicative measures P 
and valuations which are either Mobius or sub-Mobius. The alternative is the follow¬ 
ing: 

1. £ = 0. The valuation is Mobius. In this case, P is concentrated on dM and 
characterized by: 

Vx G M P( t x) = /(x). 

The series divergent. We say that P is a Bernoulli measure. 

2. e > 0. The valuation is sub-Mobius. In this case, P is concentrated on M and 
satisfies: 


Vx G M P({x}) = £/(x). 

The series Z = convergent and satisfies Z = 1/e. We say that 

P is a sub-Bernoulli measure. 


4.2.5 Uniform multiplicative measures — A valuation / : AI —> K is said to be 
uniform if /(a) is constant, for a ranging over E. This is equivalent to saying that 
/(x) only depends on the length of x, and also equivalent to saying that /(x) = p^^^ 
for some real p. 

A multiplicative measure is uniform if the associated valuation is uniform. The 
following result describes uniform multiplicative measures ill- They are related 
to the root of smallest modulus of the Mobius polynomial of the trace monoid (see 
Section|3.1.3|). 
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• Theorem 4.4— Uniform multiplicative measures P on a trace monoid M. are in bi- 
jection with the half closed interval (0,po]; where po is the root of smallest mod¬ 
ulus of fj,j ^4 . The correspondence associates with P the unique real p such that 
P(a;) = for all x G M. 

The alternative is the following: 

1. p = Po . In this case, P is Bernoulli (concentrated on the boundary). 

2. p < Po ■ In this case, P is sub-Bernoulli (concentrated on the monoid). 

4.2.6 Extension and restriction of valuations — We shall need the following result 
for the construction of the PFSA in Section [131 

• Theorem 4.5— Let M = M{T,,I) be an irreducible trace monoid. 

1. Let / : A4 —> be a Mobius valuation, let T,' be any proper subset of E and let 

M' be the submonoid ofM generated by E'. Then the restriction f : M.' —> 

of f to M' is a sub-Mobius valuation. 

2. Let E' = E \ {a}, where a is any element of E, let j\4' be the submonoid of M 

generated by E', and let f : M' —>■ be a sub-Mobius valuation. Then there 

exists a unique Mobius valuation / : A4 —)■ that extends f on Ai'. 

Proof. Proof of point^ Let /, /' and E' be as in the statement. Let also h and h' 
denote the Mobius transforms of / and of /'. Let S be the series with nonnegative 
terms: 


s = 

We claim that this series is convergent. To prove it, recall from that a pair ( 7 , 7 ') 
of cliques is said to be in normal form, denoted by 7 —>■ 7 ', if V 6 S 7 ' 3a G 7 (a, b) ^ I. 
Any trace x G M can be uniquely written as a product a; = 71 •... • 7 ^ of cliques such 
that 7 i -G- 7 i_|_i holds for alH = 1 ,..., fc — 1 . 

Let the nonnegative matrices A = B = 

where £ denotes the set of nonempty cliques of Ad, be defined by: 

^7.7' = 1(t ^ -^0 • 1(7' G M ') ■ 1(7 7') • /(7'), S^,7' = 1(7 ^ 7') • fh ')- 


It follows from the existence and uniqueness of the normal form for traces, de¬ 
composing the traces x G AA according to the number of terms of their normal form, 
that the series S writes as: 

^ = 1 + ^ / • • J = 1 -h / • • J, 

fc>l k>l 


where / and J are row and column vectors filled with Is. We know by Lemma 6.4] 
that the matrix B has spectral radius 1, since / is assumed to be Mobius, and that 
it is a primitive matrix since Ai is irreducible. But A < B with Af^B. Therefore, it 
follows from the Perron-Frobenius Theorem 21 that A has spectral radius < 1 and 
thus that the series S is convergent. 

By the Mobius inversion formula 22 , it entails that the relation {Ax&M' /(^)) ' 
h'{e) = 1 holds in the fields of reals. Hence h'{e) > 0. It remains to prove that 
h'{5) > 0 also holds for any non empty clique 5 of AA' . This is a bit easier to prove. 
For any non empty clique 5 of AA' , let AA'^^'^ and AA^^'^ be the submonoids of AA' 
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and of A4 respectively, be defined as in Section 


4.2.2 


Then C hence the 


following inequalities between series with nonnegative terms hold: 




Therefore, as above, the equality h'(S)- /(^)) = ^ holds in the field of reals, 

proving that h'(S) > 0, which completes the proof that /' is a sub-Mobius valuation. 

Proof of point [11 Let a, E' and f' : Ai' —>■ be as in the statement. Let 
also h' : M the Mobius transform of /', where is the set of cliques of A4'. 

Assuming that a Mobius extension f : A4 —>■ M!!]_ of /' exists, we first prove its 
uniqueness. For each positive real t, let /* : A4 —)■ K be the valuation defined by 
ft{a) = t and ft{a) = f'{a) for a S E'. Then any valuation on M. extending f is 
of the form ft for some t. Let /ij : K denote the Mobius transform of ft- We 

evaluate ht{e) as follows: 


htie)= Yl (-l)'"'/*(7)+ E (-l)'"'/‘(7)- 

On the one hand, observing the equality of sets ‘^' = { 76 '^ : a ^ 7 }, we 
recognize h'{e) in the second sum. On the other hand, using the notation already 
introduced a || 7 , for 7 S to denote that 0^7 and o • 7 G the range of the 
first sum above is in bijection with the set of cliques 5 G such that a || i5, the 
bijection associating J with j = a-S. We then have (—l)'"’''/t(7) = (—t) • (— 
Henceforth: 


ht{e) = {-t) ■ K+ h'{e), with a: = 1 - AT' and AT'= Y 

SG^' : a||5A(57^e 

Let A be a random trace associated with the sub-Mobius valuation /'. We eval¬ 
uate the probability of the event {7 = {35 gE' : a|| 6 Aa<A}. The inclusion- 
exclusion principle yields: 

p(c/) = Y (-i)''^'+^P(A > 7 = a:', 

SG^' : ajj^Ai^T^e 

the later equality since P(A > (5) = f'{6) by definition of X. The event U has 
probability less than 1, otherwise all pieces of E' would be parallel to a, contradicting 
that A4 is irreducible. We deduce that AT' G [0, 1) and thus AT G (0,1]. In particular, 
if ft is Mobius, it entails that ht{e) = 0, and since we have just seen that K 0, it 
implies that t = h'(e)/K, proving the sought uniqueness. 

Let us now prove the existence of a Mobius extension /. Let denote as 

above, for any clique i5 G the sub-monoid of M generated by those letters 6 G E 
such that b || 6. For each 6 G we introduce the formal series: 

Gsit) = Y 

Now, let to be the radius of convergence of the power series Ge{t) = 

Since all the power series Gs{t) have non negative coefficients, they satisfy Gs{t) < 
Ge{t) < 00 for all t G [0,to)- In particular, the radius of convergence rs of Gg 
satisfies rg > to- Actually, reasoning as in the proof of point and invoking the 
Perron-Frobenius Theorem, we see that the strict inequality rg > to holds for all 
S e. Therefore, for all <5 yf e, the series Gg{to) is convergent, and thus the Mobius 
inversion formula yields the following equality in the held of reals: Gg(to) ■ K(S) = i. 
We conclude that htg(S) > 0 for all cliques <5 7 ^ e. Since htg(e) = 0, we conclude that 
ftg is the sought Mobius valuation extending /'. □ 
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5—The Probabilistic Synchronization Algorithm 

In this section we consider a network of alphabets sharing some common letters. 
We then wish to generate random traces of the synchronization monoid, in a dis¬ 
tributed and incremental way. The first idea that comes in mind is to generate local 
Bernoulli sequences, and to see what is the synchronization trace of these sequences. 
This constitutes the Probabilistic Synchronization Algorithm, which is thus a proba¬ 
bilistic variant of the Generalized Synchronization Algorithm which was described in 
Algorithm 

The random traces thus obtained can be either finite or infinite. In all cases 
their probability distribution is multiplicative, hence the theory of Bernoulli and sub- 
Bernoulli measures for trace monoids is the adequate tool for their study. After 
having established this rather easy result, we turn to specific examples. We obtain 
the non trivial result that for path models, any Bernoulli or sub-Bernoulli measure 
can be generated by this simple technique. 


5.1 — Description of the PSA 

Let (Si,..., Eiv) be a network of N alphabets with N > 2, let S = EiU.. .UEiv , and 


let A4 = AI(S,/) be the synchronization trace monoid, as described in Section 3.4.1 


Recall that we identify A4 with the sub-monoid C H defined in Section 3.4.1 and 
that indices in 1 ,..., are seen as resources. 

Assume that, to each resource i G {l,...,Af},is attached a device able to produce 
a Ej-Bernoulli sequences Yi, either finite or infinite, with a specified probability or sub¬ 
probability distribution pi on E^. As we have seen in Section [4.1.4[ the generation of 
such Bernoulli sequences is effective, together with the information that the sequence 
is over in case where pi is a sub-probability distribution. 

Henceforth, it is straightforward for each device to produce a sampling of Yi under 
the form Zi i ■ Zi 2 • •.and moreover to tag each sub-sequence ^ with an additional 
symbol fj € {EOF, WFi} in order to deliver a sequence {Zi^k)k>i with Zi^k = Zi^k-U,k 


as specified in Section 3.4.6 The symbol fj f. is given the value WFI until the device 


decides the sequence is over (if it ever does), after which the symbol fj is given the 
value EOF. 

The Probabilistic Synchronization Algorithm (PSA), the pseudo-code of which is 
given below in Algorithm consists in executing in parallel both the local gener¬ 
ation of the sequences Yi,... ,Y]\[, and the Generalized Synchronization Algorithm 
described previously in Algorithm]^ 


Algorithm 4 Probabilistic Synchronization Algorithm 

Require: pi,... ,pjv ■> Probability or sub-probability distributions 

1 ; while Algorithm 1^ does not exit do 

2: for all i G {!,..., N} do 

3: for all fc = 1, 2,... do 

4: generate the fc**' sampling ^ of a Bernoulli sequence according to pi 

5: feed Algorithm 1^ with ^ 

6; end for 

7; end for 

8 : end while 


5.2 — Analysis of the algorithm 


21 
















5.2.1 Distribution of PSA random traces — The trace y € A4, output of the 
execution of the PSA (Algorithm]^, is random. What is its distribution? 

• Theorem 5.1— The probability distribution P of the random trace produced by the 
PSA is a multiplicative probability measure on M.. The valuation / : A1 —^ K associ¬ 
ated with this measure by f{x) = P('fi x) is such that: 

VaGS /(a) = Pi(a), (12) 

i£R{a) 

where pi is the probability or sub-probability distribution on , and R(a) is the set 
of resources associated with a. 

Proof. For each letter a G E, let Qa be the real number defined by: 

9a = n ■ 

iGR{a) 

Let 2 / G Ad be the random trace produced by the PSA. Fix z £ M a trace, and 
let {zi,..., zn) be the representation of z in Then y > z holds if and only if, for 
every index i G {1,..., N}, the corresponding sequence Xi starts with Zi . 

Let z = ai ■ ... ■ Qj be any decomposition of z as a product of generators G E. 
Since each sequence Xi produced by the device number i is Bernoulli or sub-Bernoulli 
with distribution or sub-distribution pi , and since the sequences are mutually inde¬ 
pendent, we have thus: 


P(y > z) = 


n 


Quk ■ 


k=l 


In other words, if / : Al —)■ K is the valuation defined by (121, one has P('f|' z) = 
/(z). This shows that P is multiplicative. □ 


5.2.2 Small values — We shall see that not all multiplicative probabilities on Ad can 
be reached by this technique. However, as long as only “small values” are concerned, 
the PSA can reach any target multiplicative measure. Let us first introduce the 
following convention: if / : Ad —>■ K is a valuation, and if e is a real number, we 
denote by ef the valuation which values on generators are given by: 


Va G E (e/)(a) = ef{a). 


Hence the value of ef on arbitrary traces are given by: 

Vx G Ad {sf ){x)=e^^^f{x). 


The following result states that, at the expense of having small synchronizing 
traces, the relative frequency of letters in traces produced by the PSA suffers from no 
constraint. 

• Proposition 5.2 —Let (Ei,...,E7v) be N alphabets, let Ad = Ad(E, J) be the syn¬ 
chronization trace monoid, and let t : Ai ^ M. be a valuation on Ai. Then there 
exists £ > 0 and sub-probability distributions {pi)i<i<N , with pi a sub-probability dis¬ 
tribution on Yii, such that the valuation f characterizing the random trace y £ Ai 
produced by the PSA with respect to {pi)i<i<N , satisfies: 

f = £t. 
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Proof. For £ > 0 to be specified later, let {pi)i<i<N be the family of real valued 
functions, pi : Si —>■ M, defined by: 

Va G Si pi(a) = . 

For e > 0 small enough, all Pi are sub-probability distributions over Si. According 
to Th. |5.1[ the valuation / describing the distribution of the random trace produced 
by the PSA satisfies: 


/(a) = n ■ 

i^R{a) 


The proof is complete. □ 

5.2.3 Reduction to irreducible trace monoids — Assume that the trace monoid is not 
irreducible. Hence the dependence relation (S, D) has several connected components, 
let us say that it has two components (S'!, Di) and {S2, D2) to simplify the discussion. 

Let Ii and I2 be the dependence relations on S\ and S2 defined by: 

h = {Si xSi)r\i = {Si X Si) \Di, i2 = {S2 X ^2) n / = {S2 x A2) \ D2 . 

Putting Ail = Ai{Si,Ii) and Ai2 = Af (S' 2 ,/ 2 ), we have: 

Ai = AiixAi 2 , M=AiixAi 2 - 

It follows from Theorem |5.1| that the distribution P of the random trace y produced 
by the PSA is a tensor product Pi 0 P 2 ; probabilistically speaking, y is obtained as 
the concatenation of two independent traces yi G AIi and j /2 € Ai 2 ■ The probability 
measures Pi and P 2 , of yi and y2 respectively, are identical as those deriving from 
PSA algorithms restricted to the alphabets concerning Si and S2 respectively. 

In conclusion, the PSA algorithm decomposes as a product of sub-PSA algorithms 
on the irreducible components of the synchronization trace monoid. Hence there is 
no loss of generality in assuming, for the sake of analysis, that the synchronization 
trace monoid is irreducible. 


5.3 — Example: the path model 

The path model is close to the dimer model, a topic of numerous studies in Combi¬ 


natorics and in Statistical Physics 23, 15 . Here we shall see that the path model is 


a framework where the PSA works at its best, in the sense that any multiplicative 
measure on Ai can be obtained through the PSA. 

The path model is defined as the trace monoid Ai = AI(S, /) on A-|-1 generators: 
S = {ao,...,a 7 v}, where A > 0 is a fixed integer. The independence relation is 
defined by: 


I = {{oi.aj) : \i-j\ > 1 } . 

Then Ai is the synchronization monoid of the A-tuple of alphabets (Si,..., Sjv) 
with Si = {oi-i, Ui} for i G {1,..., N}. The set of resources of oi is: 

f{l}, ifi = 0, 

^K) = <{A}, = (13) 

[ {i, i -I- 1}, if 0 < i < N. 

Assume that each of the N alphabets Si,..., S^r is equipped with a positive prob¬ 
ability distribution, say pi,.. ■ ,pn ■ Then, with probability 1, all tuples {xi ,..., xn), 
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where Xi is a Bernoulli infinite sequence distributed according to Pi , are synchro¬ 
nizing. Indeed, each Xi contains infinitely many occurrences of a^-i and of ai , and 
this is enough to ensure the synchronization. Therefore, the PSA yields a multiplica¬ 
tive measure entirely supported by the set of infinite traces of Af, hence a Bernoulli 
measure on dM. 

Actually, the following result shows that every Bernoulli measure on the path 
model can be obtained through the execution of the PSA. 

• Theorem 5.3— Let P &e o Bernoulli measure on DM, where M is the synchroniza¬ 
tion monoid associated with the path model. For i S {0,..., A}, put: 


ti = P(t fli) • 


Then there exists a unique tuple (pi,... ,Pn), with Pi a positive probability distri¬ 
bution over Yii = {ai-i,ai}, such that the random infinite trace f G dM. produced by 
the PSA based on (pi,... ,pn) has the distribution probability P. 

The probability distributions pi,... ,pn are computed recursively by: 


pi(ao) = to 

i G {2,... ,iV} Pi(ai_i) = 


ti--] 


Pi-i(ai-i) 


Pi(ai) = l-to (14) 

Pi(ai) = 1-pi(ai_i). (15) 


Remark. The relations (14)-(15) are necessary conditions, almost immediate to es¬ 


tablish from the result of Theorem 15.II What is not obvious is that the numbers thus 
defined stay within (0,1), yielding indeed probability distributions (pi(ai_i) Pi(ai)) 
over Si, and that PAr(ajv) = tjv- 


Proof. Let / : A4 —>■ K be the valuation associated with P. According to Theorem |4.3[ 
/ is a Mobius valuation—we will use this fact in a moment. 

Proof of uniqueness of {pi,. .. ,pn) and proof of (14)-(15). With respect to 
a tuple (pi,... ,Pn) of positive probability distributions over (Ei,..., Sjv), the PSA 
produces a random infinite trace f G dM. Let p : A4 —>■ M be the valuation associated 
with the probability distribution of f. Then, according to Theorem |5.H one has: 

ViG{0, ...,A} gia,) = Pr(a*)- 

rGR{ai) 


Referring to (13), / = p is equivalent to: 


0 <i < N 


to =Pi(ao) 
tN = PN{aN) 

ti = pi{ai)pi+i{ai) 


(16) 

(17) 

(18) 


It follows at once that the tuple (pi,... ,pAr) inducing P through the PSA, if it 
exists, is unique and satisfies necessarily the recurrence relations (14|-(15). 

Proof of existence o/ (pi,... ,pn)- Instead of starting from the recurrence rela¬ 
tions (14|-(15), we use a different formulation. For i G {—1, •. ■ ,N}, let A4o,i be the 
sub-trace monoid of M generated by {oq, ..., a^}. Let also po.i be the evaluation on 


24 












(^ 0 , ■ ■. ,ti) of the multivariate Mobius polynomial of A4o,i (see Section 3.1.4). Hence: 


Po-l 

= 1 





pop 

= 1 - 

io 




Mo.l 

= 1 - 

^0 — 




Mo.2 

= 1 - 

^0 — 

ti — 

t2 + 

tot2 

Mo,3 

= 1 - 

^0 — 

ti — 

^2 — 

ts + tot2 + tots + tito 

MO.4 

= 1 - 

^0 — 

ti — 

^2 — 

^3 — ^4 H“ ^0^2 H“ ^0^3 ^0^4 “t“ ^1^3 H" ^1^4 “t“ ^2^4 — ^0^2^4 

For 

any i 

e {- 

-1,. 

.,A 

— 2|, the monoid Ain z coincides with the sub-monoid 


as defined in Section 


4.2.2 


where is the following clique of M: 


ai +2 • ai +4 •. ■ o,N , ii N — i = 0 mod 2, 

ai +2 • «i +4 •.• O'N-i ; if i = 1 mod 2 . 


The clique Ci is non empty as long as i < iV — 1. Therefore, according to (10), the 
Mobius conditions on / ensure the positivity of all numbers for z < — 1. 

Let is {0,..., A^ — 2}. Any clique 7 of Af o,i+i either belongs to Mo,i, or contains 
an occurrence of a^+i. In the later case, this clique 7 is of the form 7 = a^+i • 7 ', with 
y ranging over cliques of It follows at once that the following recurrence 

relation holds: 


0 < i < A — 1 (19) 

Similarly, any clique 7 of A4o,n is either contained in A4o,n-2 , or it contains 
an occurrence of oat-i, in which case it writes as 7 = un-i • y with 7 ' ranging 
over cliques of A 4 o,Ar- 3 , or it contains occurrence of otv, in which case it writes as 
7 = qn • y with y ranging over cliques of Alo.Ar- 2 - We deduce: /io,Af = Mo,Af -2 — 
tN-ifJ‘ 0 ,N -3 — tNfJ' 0 ,N- 2 - But /io,Ar is the evaluation on (to, •. •, tw) of the multivariate 
Mobius polynomial of Af = hence fio,N = 0 since / is a Mobius valuation. We 

obtain: 


(1 — tAr)/Zo,Af -2 — — 0 . 


( 20 ) 


Now, since all the numbers fiQ^i for i G { —1,..., A — 2} are positive, we define the 
family {pi)i<i<N as follows: 


pi(ao) = to 

l<z<A pAaj_i)=ti_i- 

Moy -2 

PN{aN-i) = 1 — tN 


Pi(ai) = 1 - to 


( 21 ) 


Ma^) = ( 22 ) 

M 0 ,i -2 

PNiaN) = tN (23) 


All numbers appearing in (21), (22) and (23) are positive, and equations (16) 
and 0 are satisfied. As for (|18[), we write, for 1 < z < A — 1 : 


/ \ t \ P0,i—1, p0,i—2 , 

Pi[ai)pi+i{ai) = - U —-= ti 

p0,i-2 P0,i-1 


For z = A — 1, we have: 


PN-i{aN-i)PN{aN-i) = ^ (1 — tN) = tN-l, 

P0,N-3 
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the later equality by (|20[). We have shown so far that (|16[), (|17|) and (18) are satisfied. 


It remains to see that all (pi(oi_i) pi(ai)) , for i ranging over {2,..., iV — 1}, 
are positive probability vectors, since this is trivially true for i = 1 and for i = N. 
We have already observed that they are all positive vectors. For 1 < i < TV, one has: 


p,{a,) + = 


Mo,z- i + ^i-iM oy-3 
M0,i-2 


= 1 


by virtue of (19). Hence each pi is indeed a positive probability distribution on , 
which completes the proof. □ 


Theorem |5.3| can be adapted to the case of sub-Bernoulli measures instead of 
Bernoulli measures, still for the path model, as follows. In a nutshell: every sub- 
Bernoulli measure can be simulated by synchronization of sub-Bernoulli sequences, 
but there is no uniqueness in the choice of the local sub-probability distributions. 

• Theorem 5.4 —Let P 6 e a sub-Bernoulli measure on A4, where Ai is the synchro¬ 
nization monoid associated with the path model. For i S {0,..., N}, put: 


ti = P(t Oi) . 

Then there exists a tuple (pi,... ,pjv), with p, either a probability or a sub- 
probability distribution over = {ai-i,ai}, such that the random trace ^ G Ai 
produced by the PSA based on (pi,...,pAr) is finite with probability 1 and has the 
distribution P. 


Proof. Using the same notations as in the proof of Theorem 5.3 we define (pi)i<i< 7 v 
as follows: 


pi(ao) = to 

I <i < N pi{ai-i) = 


Pi(ai) = 1 - to 


(24) 


PAf(aw-i) = 


A0,i-3 

A0,i-2 

tN-l 


Pi{ai) = tffLA (25) 

h0,i-2 

PN{a,M)=tM (26) 


PN-i{aN-i) 

The only difference with the definitions introduced in the proof of Theorem |5.3 


lies in (26). As a consequence, all pj(ai_i) and Pi{ai) are positive. They satisfy 
Pi(ai)Pi+i(oi) = ti for all i G {2,..., A^— 1}, and obviously pi(ao) = to and PAr(aAr) = 
tN . What remains to be proved is that the sums pi{ai-i) -l-pi(ai) stay within (0,1] 
for alH e {1,..., A^}. 

For i G {1,..., At — I}, the sum is I, just as in the proof of Theorem 5.3 And for 
z = At, we have: 


PN^aN-l) +PAr(aAr) < I 


► tN+tN-l^^^^^ < 1 
Ao,Ar-2 

AO.iV-2 — tNP0,N-2 — tN-lP0,N-3 > 0 
AAl(tO) • ■ ■ jtjv) > 0 

The last condition is satisfied since P is a sub-Bernoulli measure. The proof is com¬ 
plete. □ 


5.4 — Finitary cases 

5.4.1 Example of a ring model — Consider the ring model already introduced in 
Section [2]2l The trace monoid is: 

A4 = (ao, tti, 02,03 I 00 O 2 = 0200 , 0303 = 0301 ). 


26 












This is the synchronization monoid associated with the network of alphabets (Eg, Ei, E2, E3) 
given by Eg = {03,00}, Ei = {oo,oi}, E2 = {01,02} and E3 = {02,03}. Contrasting 
with the path model, we shall see on an example that the PSA for this ring model 
with four generators produces finite traces with probability 1 . 

Let po,... ,p 3 be uniform distributions on Eg, ..., E3 . According to Theorem |5.H 
the PSA yields a multiplicative and uniform probability measure P on with pa¬ 
rameter p = (l/2)(l/2) = 1/4: 


Vx G J\4 P('fl' x) = 


l\l^l 


(27) 


The Mobius polynomial of Ai is (t) = 1 — 4t -|- 2t^ , whic h roo t of smallest 
modulus is po = 1 — 11^/2. Since 1/4 < pg, it follows from Theorem 4.4 that the PSA 
produces a finite trace y G Ai with probability 1 . 

The av erag e length of y is easily computed. Following the notations introduced 
we put G{z) = P = 1/'^ ^ = Pm{p)- Using that 

'I and G{z) = 1/pik(z), we have: 


4.2 


in Section 


V{y = x) = ep 


E|p| = E 


x^M. 


I im,/ N dG 

|x|P(p = x)= ep— 


= p PMiP) 
b=p] ^Pm{p) 


= 6 . 


5.4.2 Generalization: trees and cycles — Since the probability distribution of the 
random element produced by the PSA is a multiplicative probability measure, we 
know by Theorem 4.3 that the trace is either finite with probability 1 or infinite with 
probability 1. The examples studied above show that both cases may occur indeed. 

It turns out that the dichotomy is solved by a simple criterion, as stated below. 


• Proposition 5.5 —Assume that the synchronization monoid Ai = A4{T,,I) associated 
with a tuple of alphabets (Ei,...,E 7 v) is irreducible, and let D be the dependence 
relation of A4. Let pi,... ,pAr be positive probability distributions on Ei,..., Ejv re¬ 
spectively. Then 


1. The PSA based on (pi,...,p 7 v) produces an infinite trace with probability 1 if 
and only if: 

j |Ui n Ejl < 1 ; and 

b) the non oriented graph (E,Z 1 ) has no cycle. 

2. If the output X of the PSA is finite with probability 1, then the length |A| has 
a finite average. 


Proof. Assume first that both conditions la and lb are met. Let Y = be a 

tuple of infinite sequences Yi G (Ei)“. With probability 1, for any distinct i and j 
such that Ei n Ej 7 ^ 0 , the sequence Yi has infinitely many occurrences of the unique 
element belonging to E^ H Ej. Just as in the case of the path model, this is enough 
to guarantee that the synchronization trace of Y is infinite. 

Conversely, assume that one of conditions and is not met, for instance 
condition la Let i and j be distinct indices such that E^ n Ej has at least two 


distinct elements a and b. Let Y = (Ur)rGE be a random vector of sequences such 
that the synchronization trace of Y is infinite. Since A4 is assumed to be irreducible, 
we observe that, with probability 1, if the synchronization trace of Y is infinite then 
all coordinates of Y are infinite. Hence there is no loss of generality in assuming that 
both coordinate Yi and Yj are infinite. Then the order of occurrences of a and b in 
both coordinates must be the same, which has probability 0 to occur. 
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Finally, assume that condition is not met, hence the presence of a cycle 
(ri, r2 ,..., Tfc) in the dependence relation. We assume without loss of generality 
that are pairwise disjoint. Let oi G H Sr^, 02 G Sr2 F 

Ofe G Ork n E^i. Focusing on the coordinates ri,..., of F only, a pattern of the 
form (oittfe, 0201,0302 ,..., afc_iafc_2, afcOfc-i) shall occur with probability 1. Since 
such a pattern is blocking the synchronization, it follows that the synchronization 
trace is finite with probability 1. 

We have proved so far the equivalence stated in point We now come to the 
proof of point and assume that |X| < cx) with probability 1. The average of |X| is 
computed as the following mathematical expectation: 

E|X| = ^ |a:|P(X = cr) = e \x\fix), 

aiGA1\{e} 


where / is the sub-Mobius valuation associated with P, and e is the constant given 
by the Mobius transform of / evaluated at the empty heap (see Section 4.2.4). Let 
G{\) be the power series: 


G(A) = ^ Al"l/(a:) 

x^M 


Then G(l) = e ^ < c» according to Theorem 4.1 point and since G has non nega¬ 


tive coefficients, it implies by the Pringsheim Theorem that the radius of convergence 
of G is greater than 1. Hence so does its derivative, and thus G'(l) < 00. Since 
E|X| = eG'(l), the result of point [^follows. □ 


6—The Probabilistic Full Synchronization Algorithm 

The PSA produces random traces, either finite or infinite. Proposition |5.5| shows 
that the ability of the PSA to produce finite or infinite traces does not depend on 
the probabilistic parameters one chooses to equip the local alphabets with. It rather 
depends on the structure of the synchronization monoid. 

To be sound, testing procedures and statistical averaging techniques require ar¬ 
bitrary large traces, which binds us to the mathematical model of infinite traces. In 
case where the PSA fails to produce infinite traces, we are thus left with an unsolved 
problem. Yet, we can produce finite traces... and the most natural thing to try from 
there, is to start the PSA over and over, and to concatenate the finite random traces 
obtained at each execution of the PSA. The limiting trace is infinite and random, 
couldn’t it just be the one we were looking for? 

It is quite surprising to realize that this strategy fails in general. Of course the 
unlimited concatenation of finite traces, with a positive average length, necessarily 
produces an infinite trace. But the failure comes from the distribution of this random 
infinite trace. We will show on an example below that it is not a Bernoulli measure 
in general. In particular, the uniform measure is thus unreachable by this technique. 

Nevertheless, we introduce an algorithm based on recursive concatenation of finite 
random traces and that outputs, if executed indefinitely, an infinite trace which is 
always distributed according to a Bernoulli measure. The intermediate, finite random 
traces, are obtained by a trial-and-reject procedure based on the PSA. The whole 
procedure constitutes the Probabilistic Full Synchronization Algorithm (PFSA). In 
Section [6.4[ we show that any Bernoulli measure can be simulated by the output of 
the PFSA for the ring model. 


6.1 — Convolution of probability distributions and random walks 
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6.1.1 Convolution and random walks — We recall the general definition of con¬ 
volution for a countable monoid A4. Let ly and 9 be two probability distributions 
on Ai. Assume that X and Y are two independent random variables with values 
in M., distributed according to v and to 9 respectively. Then the convolution n * 9 is 
the distribution of the random variable X • Y, and it is given by the Cauchy product 
formula: 


Vx € At n * 9{x) = 

{y,z)£M. : y-z—x 


The convolution product is associative. 

Given a probability distribution v over Ad, let {Xn)n>i be a sequence of indepen¬ 
dent random variables with values in At, and identically distributed according to v. 
The random walk associated with n is the sequence of random variables {Yn)n>o de¬ 
fined by lb = G, the unit element of the monoid, and inductively: T„+i = T„ • Al„+i 
for all integers n. If Vn denotes the distribution of lb, we have = ^*(”1 for all 
integers n, the n**' convolution power of n with itself. 

Each trajectory (lb)„>o of the random walk is nondecreasing for the divisibility 
relation in the monoid. Hence, if we assume now that AI is a trace monoid, the 
nondecreasing sequence (Tn)n>i has a least upper bound in the completion A4, say 
dbo = Vn>i db- We introduce the notation iy*°° for the probability distribution of Yaa, 
which we call the limit distribution of the random walk (it is also called the harmonic 
measure of the random walk). 

6.1.2 An example where the concatenation of PSA traces does not yield a limit 
multiplicative measure — We consider the ring synchronization monoid on four gen¬ 
erators AI = (00,01,02,03 I O0O2 = O2O0, O1O3 = O3O1), corresponding to the network 
of alphabets (Eq, Ei, E 2 , E 3 ) with Eg = { 03 , 00 }, Ei = { 00 , 01 }, E 2 = { 01 , 02 } and 
S 3 = { 02 , 03 }. Each alphabet E^, for z = 0,..., 3, is equipped with the uniform prob¬ 
ability distribution. Let be the distribution on dAi of the infinite trace obtained 
by concatenating infinitely many independent copies of a finite trace generated by 
the PSA. Then we claim: the limit distribution v*°° is not Bernoulli. 

Seeking a contradiction, assume that it is. Clearly, zz*°° is concentrated on the 
boundary dAi. And for symmetry reasons, it is necessarily the uniform distribution, 
and thus given by x) = where po is the root of smallest modulus of the 

Mobius polynomial fj,(z) = 1 — iz + 2z^. 

We extend the concatenation of traces x-y with (x,y) G AixAi to the case where 
y is an infinite trace by putting x-y = \/{x-yn : n > 1 } for y = \/{pn '■ n > 1}, and 
this definition does not depend on the choice of the sequence {yn)n>i- This yields also 
an extension of the notion of convolution v * 9 to the case where v is concentrated 
on Ai, but 9 might be a probability distribution on Ai. The construction of iy*°° 
implies the fix point property = 1 /*°°, where Vp{f\ x) = is the distribution 

of the PSA, here given by p = 1/4. In particular for the cylinder f uq , this yields: 

Po = y't'p({a 2 })i^*“(t ao) + i^pihao) =Po(l - dp + 2p‘^)-^ - \-p 

t^o 

Simplifying by p 7 ^ 0, we obtain: p = (3po—I)/(2po —I), and since 1—4po-l-2po = 0, 
it yields p = po, a contradiction. Actually, we have shown the strongest result that 
no random walk based on the distributions x) = pl®l with p G (0,po) has a 

Bernoulli measure as limit distribution 1 ^*°°. 

6.2 — First hitting times and pyramidal heaps 
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6.2.1 First hitting times — First hitting times for random heaps formalize the idea 
of the first time of occurrence of a given piece—yet, without an explicit notion of 
time at hand. It generalizes to random heaps the analogous notion, for a Bernoulli 
sequence, of first time of reaching a given letter. 

Let A4 = A1(E,/) be a trace monoid, and let a S S be a given letter. The 
number of occurrences of a in the congruent words defining a trace x is constant, and 
depends thus only on the trace x. We denote it |a;|a. For any infinite trace ^ S dM, 
let La{^) = {x G M : x < \x\a > 0}. If non empty, the set Lq(C) has a minimum 
which we denote by 14 (^), and it satisfies 114(C) I a = 1- Intuitively, 14(C) represents 
the smallest sub-trace of C with at least an occurrence of a. 

If dM is equipped with a Bernoulli measure P, then La(C) ^ 0 with probability 1. 
Hence, neglecting a set of zero probability, we may assume that 14 : dM -G M is 
well defined. The mapping 14 is called the first hitting time of a. The distribution 
of the first hitting time of a is the probability distribution of the random variable 14. 
It is a discrete probability distribution on Al, which we denote by Pq, and which is 
defined by Pa(x) = P(14 = x) for aW x G M. 

We will base our random generation of infinite heaps on the following result. 

• Theorem 6.1— Let P be a Bernoulli measure equipping the boundary dM of an ir¬ 
reducible trace monoid M = M(S,I). Let a G Pi, and let Pa be the distribution of 
the first hitting time of a. Then P*°“ = P^ where P*°° is the limit distribution of the 
random walk on M associated with Pq. 

Sketch of proof. Let (l/")n>o be the sequence of iterated stopping times associated 
with the first hitting time 14, as defined in Def. 5.2]. Under the probability P, 
it follows from Prop. 5.3] that, for each integer n, U" has the same distribution 
as the n**' step of the random walk associated with Pq. Since M is assumed to be 
irreducible, the sequence (U")n>o is exhaustive as defined in Def. 5.5], from which 
the result derives. □ 


As a consequence, if we can simulate the distribution Pa of the first hitting time 
of some piece a, we will be able to simulate a P-distributed infinite random heap. 
The improvement lies in the fact that first hitting times are finite heaps. Our next 
task consists thus in studying more closely the distribution of the first hitting time, 
after which we shall see how to simulate it. 

6.2.2 Pyramidal heaps and the distribution of the first hitting time — Recall that 
any trace has a interpretation as a labeled partial order of pieces (see Section 3.2.2[ ). 
A trace x is pyramidal if, as labeled partially ordered set, it has a unique maximal 
element (a notion introduced by Viennot 22 ). Any trace of the form x = 14(0 
some ^ G dM is pyramidal, with its unique occurrence of a as its unique maximal 
piece. 

Then the set Va of traces x G M in the image of the mapping 14 : dM -A M can 
be described as follows: Va is the set of pyramidal traces x G M such that the piece a 
only occurs as the unique maximal piece; see Figure]^ Furthermore, we observe that, 
if X e Va, then {14 = x} = t x (an intuitive property, also proved in Prop. 4.2]). 
It follows that the distribution Pa of the first hitting time has the following simple 
expression: 


\/x G M Pa(x) 


0, if X ^ Va 

P(tx), ifxeVa 


(28) 


6.2.3 Generating pyramidal heaps — We consider a network of alphabets (Ei,..., Ejv) 
with E = El U ... Ejv, such that the synchronization trace monoid M = A4(E, J) 
is irreducible. We pick an arbitrary letter a G S. For each i G {!,..., A}, let 
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x = c- b- d- c- b- d- a 


y = c-b- d- c-b ■ a 



Figure 7: (i); For M. = {a,b,c,d : ac = ca, ad = da, bd = db): a trace x ^ Va 
(it is not pyramidal) and its associated labeled poset. (ii); a trace y G Va and its 
associated labeled poset on the right. 


S' = Ei \ {a} if o € Si, and S' = S^ otherwise. Then the synchronization monoid 
M' of (Sj,..., S'jy) coincides with the sub-monoid of M generated by S \ {a}. 

We assume, for each i S A^},to be given p', either a probability distribution 

or a sub-probability distribution over S', such that the output X of the PSA executed 
on (S']^,..., S'^) with those parameters, is finite with probability 1. 

Given the parameters (p^,... ,p'^), we consider the execution of the trial-and- 
reject procedure described in pseudo-code in Algorithm below. At each run of the 
loop, the algorithm needs to decide whether some trace V is pyramidal in Ai or 
not. It is clear that a—far from being optimal—scanning procedure, examining all 
elements of V starting from the right, will successfully complete this job. 


Algorithm 5 Outputting a pyramidal trace in Va 

Require: - 


1 

A ^ e 

t> Initialization 

2 

repeat 


3 

call Algorithm]^ 

[> Calling the PSA 

4 

X y- output of Algorithmic 


5 

V ^X-a 


6 

until V is pyramidal in Ai 

[> See comment 

7 

return V 



By assumption, the PSA which is repeatedly called in Algorithm always termi¬ 
nates and outputs a finite trace. Since the probability of success if positive, since for 
instance X = e yields a success and has positive probability to be output by the PSA, 
Algorithm always terminates. The probability distribution of the output is given 
in the following lemma, stated with the notations introduced above. 

• Lemma 6.2 —Let /' : AA' -y be the sub-Mobius valuation associated with the out¬ 
put of the PSA running on AA'. Then the distribution of the output V of Algorithm\^ 
is concentrated on Va and given by: 

VueVa F(y = v) = K ■ f{v/a), (29) 
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where v/a denotes the heap obtained by removing from v its unique maximal piece a, 
and K is a normalization constant. 


Proof. Let Q denote the probability distribution of the output X of the PSA running 
on M' with the specified parameters ... ,p^). Then f'{x) = x) for x G M'. 
According to Theorem 4.3 for some constant e > 0, we have Q(X = x) = ef'{x) for 
all X G M'. 

The rejection procedure amounts to considering the distribution of 14 = X ■ a 
conditioned on A • a G Vq. Hence the probability for Algorithm to issuing an 
element u € Va is: 


F{V = v) = 


Q(A ■ a = v) Q(A = v/a) 


QiX-aG Va) Q(X-aG VJ 
where K is the constant K = e/Q(A ■ a GVa). 


= K-fiv/a), 


□ 


Note that the form (291 is almost that of a valuation evaluated at v. The contri¬ 


bution of the last piece a is missing, but the constant K is adequately placed to play 
the role of this missing contribution. This will be used in the proof of Theorem |6.3[ 


6.3 — The Probabilistic Full Synchronization Algorithm 


We are now ready for constructing a probabilistic algorithm generating Bernoulli- 
distributed infinite traces. The framework consists of a network (Si,..., Sjv) of 
alphabets, such that the synchronization trace monoid At = Af(E,/) is irreducible, 


with E = El U • • • U Eiv- 

6.3.1 Description of the algorithm — Having chosen an arbitrary piece a € E, we 
consider a family (p/,... ,p'j^) of probabilistic parameters as above, i.e., with the 
constraint that the PSA executed on the sub-monoid A4' generated by E \ {a} and 
with these parameters, outputs a finite trace with probability 1 . 

The Probabilistic Full Synchronization Algorithm (PFSA) is described in pseudo¬ 
code in Algorithm below. The PFSA is an endless loop, incrementally writing to 
its output register X. It simulates thus the random walk on A4 with increments 


distributed according to the distribution established in Lemma 6.2 


Algorithm 6 Probabilistic Full Synchronization Algorithm 

Require: - 

1 ; A ^ e 

[> Initialization 

2 : repeat 

3 : call Algorithmic 

4 ; V G- output of Algorithmic 

5 : X ^ X -V 

> Random pyramidal trace V G Va 
> Increments the random walk 

6 : until false 



The analysis of Algorithm is twofold: a probabilistic analysis carried on below 
and a complexity analysis carried on in Section [ 6 . 3. 3[ 

6.3.2 Probabilistic analysis of the PFSA — Recall that, by convention, the output 
of the PFSA is the random infinite heap, least upper bound in dXi of the finite heaps 
recursively written in its output register. The probability distribution of this infinite 
heap is as follows. 


• Theorem 6.3— We consider the execution of the PFSA in the framework described in 
Section 6.3.1 and we adopt the same notations. Let f '. M! ^ be the sub-Mobius 
valuation associated with the PSA executed with the chosen parameters {pi,... ,p'j^), 
and let X^o be the output (with the convention recalled above) of the PFSA. 
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, is distributed according to a Bernoulli measure on dAi. The associated 
> K is the Mobius valuation on M that extends f : j\4' —)■ K fthe 


Then 

valuation f 

existence and uniqueness of which are stated in Theorem 4-5)- 


Proof. Let / : —>■ be as in the statement. Let P be the Bernoulli measure on 

dAi defined by P(t a;) = f{x), which is well defined according to Theorem 4.3 

Letting Q be the distribution of X^o, we have to prove that P = Q. Let 5 : Va — t 
be the probability distribution of the increment V in the PFSA. Since Q is the 
limit distribution of the random walk on Ai with increments distributed identically 


to V^ it follows from Theorem 6.1 that we only need to show that g{x) = f’iVa = x) 
for all X €Va- 


According to (28), we have: 


Va; e Va F{Va = x) =P{t x) = f{x). 
Whereas, according to Lemma [6. 2 [ we have for some constant K: 


Va; G Va g{x) = Kf'{x/a) = 


K 

/(a) 


fix). 


(30) 


(31) 


Summing up over Va in (30) yields fix) — whereas summing up over Va 


in (31) yields 1 = iK/ f{a)J^Y.x(iV^ fix)) = K/f{a). Therefore K = /(a), which 
yields after re-injecting in ( [sT] ): g{x) = f{x) and thus g{x) = P(Va = x) for all a: G Va. 
Since P has the desired properties, the proof is complete. □ 


6.3.3 Complexity analysis of the PFSA — We will limit our analysis to the following 
observation: the size of the output register of the PFSA grows linearly with time in 
average. 

Indeed, each PSA call in Algorithm takes a finite amount of time in average 
according to Proposition |5.5[ pointj^ Since the probability of success in the trial- 
and-reject procedure of Algorithm l^s positive, it will thus execute in average in a 
fixed amount of time, whence the average linear growth of the output register of the 

PFSA. 

Another question is to compute adequately the probabilistic parameters. We will 
discuss it briefly in Section]^ after having examined some examples. 


6.4 — Example: ring models 

6.4.1 A general result — For the ring models, the following result shows that any 
Bernoulli measure can be simulated by executions of the PFSA. 

• Theorem 6.4 —Let (oq, ..., uat-i) be N distinct symbols, and let (Eg,..., Tjn-i) be 
the network of alphabets defined by: 

0<i<N-l Ei = {ai_i,aJ, Eg = {oat-i, ag} . 

The synchronization monoid Ai = AI(E,/) is described as follows: 

E = {og,..., oat-i} , I = {{ai,aj) : (i — j mod N > 2) A {j — i mod A > 2)} . 

Then any Bernoulli measure on dAi can be simulated by the endless execution of 
the PFSA, and in particular the uniform measure on dAi. 

Proof. Let P be a target Bernoulli measure on dAi, and let / : AI —M!j_ be the 
associated Mobius valuation. We pick og as the piece to be removed. Let AI' be the 
submonoid of AI generated by oi,... ,aN- Then AI' is a path model. Furthermore, 
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Figure 8: (i): synchronization monoid for the ring model with five generators. 

(ii): generators of the path model on which the PSA is run at each loop of the PFSA. 


since A 4 is irreducible, it follows from Theorem | 4 . 5 | point[l]that the restriction of / to 
M' is sub-Mobius. According to Theorem | 5 . 4 | the associated probability distribution 
can be obtained by running the PSA with suitable parameters. Running the PFSA 
based on this instance of the PSA, we generate a Bernoulli measure which Mobius 
valuation, say g, extends to M the restriction of / to M'. By the uniqueness of the 
extension of sub-Mobius valuations to Mobius valuations (Theorem 4.5 point [2]), it 
follows that g = f. □ 


6 . 4.2 Example — We consider the example of the ring model A 4 on five generators, 
the synchronization graph of which is depicted in Figure]^ (i). We aim at generating 
the uniform measure on 9 A 4 , say iz, characterized by x) = ' , where Po = ^ — ^ 

is the root of smallest modulus of pjvt (t) = 1 — 5 t + . 

We pick do as our distinguished piece, as depicted in Figure]^ (ii). Then, we wish 
to run on the sub-monoid 


M' 


(01,02,03,04 I 01O3 = 03O1, 01O4 = 04O1, 0204 = 0402) 


a PSA with associated valuation f : M' —)■ K such that f'{x) = for all x G Ai'. 
Note that pq is not the root of pM' ! 

Referring to the computations performed in the proof of Theorem | 5 . 4 [ adequate 
solutions for p2, ps and p4 , respectively on {oi, 02}, on {02, 03} and on {03, 04} , are 
obtained as follows: 


P2(ai) 

^ 3 ( 02 ) 

^ 4 ( 03 ) 


1 Vs 
“ 2 ~ lO 

Po ^ 3 _ 
1 -po 2 

Poi^-Po) ^ 

1 - 2po 


« 0.276 


2 

1 

7! 


« 0.382 

« 0.447 


P2(a2) = ^~P^ = \ + ^ ~ 0.724 

P 3 K 03 ) = - -= -- -K — « 0.618 

1 - Po 2 2 

^ 4 ( 04 ) = ^^0 = ^ ^ ~ 0.276 


7—Computational issues and perspectives 

When trying to use the PFSA in practice for simulation and testing, one might be 
concerned by the fact that it incrementally outputs heaps with a particular shape, 
namely they are all pyramidal. Furthermore, the tip of these pyramidal heaps is 
always labeled with the same letter, corresponding to an arbitrary choice made before 
executing the algorithm. Actually, there are good reasons not to worry about that. 
Indeed, a large class of statistics on heaps can safely be computed on these particular 
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heaps, and they will asymptotically be indistinguishable from statistics computed on 
arbitrary large heaps. For instance, an asymptotics of the speedup, i.e. the ratio 
height over number of pieces in large heaps, can be estimated in this way. Precise 
results on this topic are found in |^, under the name of cut-invariance. 

Another concern is the following. Even if the PFSA were proved to be able to 
simulate any Bernoulli measure for any topology, not only for the ring topology, 
there is no doubt that its execution would still need the precomputation of adequate 
probabilistic parameters, and in particular the root of smallest modulus of the Mobius 
polynomial of the synchronization monoid. Given that the determination of this 
polynomial on a general synchronization graph is an NP-complete problem (since the 
independence set decision problem is NP-complete i), this precomputation method 
appears unrealistic to be used in practice. 

However, what we need is not the Mobius polynomial itself, but only its root of 
smallest modulus. It is not theoretically forbidden to think that this root might be 
approximated in polynomial time, even though the Mobius polynomial is hard to 
find. Actually, one can even think of a feedback procedure based on our generation 
algorithms to find an approximation of this root. Indeed, we could execute the gener¬ 
ation algorithm with arbitrary parameters, then adjust the probabilistic parameters 
in order to increase uniformity, then re-run the generation algorithm and re-adjust 
the parameters, ans so on. It is reasonable to expect that such a procedure would 
lead the parameters to converge toward the critical value entailing uniformity, which 
is the root of smallest modulus of the Mobius polynomial. This interesting question 
may deserve a dedicated work. 
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